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<110> APPLICANT : Kuramitsu, Seiki 

Yokoyama, Shigeyuki 
<120> TITLE OF INVENTION: Novel DNA Repair Enzymes, 

Enzymes And Methods of Using Them 
<130> FILE REFERENCE: 11283 -013001/PH1261 US 

11 <140> CURRENT APPLICATION NUMBER: US 09/938, 901A 

12 <141> CURRENT FILING DATE: 2001-08-24 
<150> PRIOR APPLICATION NUMBER: JP 47762/2001 
<151> PRIOR FILING DATE: 2001-02-23 

17 <160> NUMBER OF SEQ ID NOS : 65 
19 <170> SOFTWARE: Patent In version 3.3 
21 <210> SEQ ID NO: 1 
<211> LENGTH: 975 
<212> TYPE: DNA 

Thermus thermophilus 



14 
15 



22 
23 



Nucleic Acids Encoding DNA Repair 



****** 




24 <213> ORGANISM: 
27 <220> FEATURE: 



_ _NTI 

m 




28 


<221> NAME /KEY : 


CDS 


























29 


<222> LOCATION: 


(1) 


. . (975) 






















31 


<4 00> SEQUENCE: 


1 


























32 


atg 


gag 


gee 


tgg 


egg 


aaa 


gcc 


etc 


etc 


gcc 


tgg 


tac 


egg 


gaa 


aac 


gcc 


48 


33 


Met 


Glu 


Ala 


Trp 


Arg 


Lys 


Ala 


Leu 


Leu 


Ala 


Trp 


Tyr 


Arg 


Glu 


Asn 


Ala 




34 


1 








5 










10 










15 






36 


cgc 


ccc 


etc 


ccc 


tgg 


egg 


ggg 


gag 


aag 


gac 


cct 


tac 


cgc 


gtc 


ctg 


gtc 


96 


37 


Arg 


Pro 


Leu 


Pro 


Trp 


Arg 


Gly 


Glu 


Lys 


Asp 


Pro 


Tyr 


Arg 


Val 


Leu 


Val 




38 








20 










25 










30 








40 


tec 


gag 


gtc 


ctt 


ctg 


cag 


cag 


acc 


egg 


gtg 


gag 


cag 


gcc 


etc 


ccc 


tat 


144 


41 


Ser 


Glu 


Val 


Leu 


Leu 


Gin 


Gin 


Thr 


Arg 


Val 


Glu 


Gin 


Ala 


Leu 


Pro 


Tyr 




42 






35 










40 










45 










44 


tac 


cgc 


cgc 


ttt 


ctg 


gag 


cgc 


ttt 


ccc 


acc 


ctg 


aag 


gcc 


ctg 


gcc 


gcg 


192 


45 


Tyr 


Arg 


Arg 


Phe 


Leu 


Glu 


Arg 


Phe 


Pro 


Thr 


Leu 


Lys 


Ala 


Leu 


Ala 


Ala 




46 




50 










55 










60 












48 


get 


tec 


ctg 


gaa 


gag 


gtc 


ctt 


agg 


gtc 


tgg 


cag 


ggg 


gcg 


ggc 


tac 


tac 


240 


49 


Ala 


Ser 


Leu 


Glu 


Glu 


Val 


Leu 


Arg 


Val 


Trp 


Gin 


Gly 


Ala 


Gly 


Tyr 


Tyr 




50 


65 










70 










75 










80 




52 


egg 


egg 


gcg 


gaa 


cac 


etc 


cac 


cgc 


ctg 


gcc 


cga 


age 


gtg 


gag 


gag 


ctt 


288 


53 


Arg 


Arg 


Ala 


Glu 


His 


Leu 


His 


Arg 


Leu 


Ala 


Arg 


Ser 


Val 


Glu 


Glu 


Leu 




54 










85 










90 










95 






56 


ccc 


ccg 


age 


ttc 


gec 


gag 


ctt 


egg 


ggg 


ctt 


cct 


ggt 


etc 


ggg 


cct 


tac 


336 


57 


Pro 


Pro 


Ser 


Phe 


Ala 


Glu 


Leu 


Arg 


Gly 


Leu 


Pro 


Gly 


Leu 


Gly 


Pro 


Tyr 




58 








100 










105 










110 








60 


acc 


gcg 


gcg 


gcg 


gtg 


gcc. 


tec 


ate 


gcc 


ttc 


ggg 


gag 


egg 


gtg 


gcg 


gcg 


384 


61 


Thr 


Ala 


Ala 


Ala 


Val 


Ala 


Ser 


He 


Ala 


Phe 


Gly Glu 


Arg 


Val 


Ala 


Ala 




62 






115 










120 










125 
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64 


gtg gac 


ggg aac 


gtc 


egg agg 


gtc 


etc 


tec 


cgc 


etc 


ttc 


gec 


egg 


gaa 


432 


65 


Val Asp 


Gly Asn 


Val 


Arg Arg 


Val 


Leu 


Ser 


Arg 


Leu 


Phe 


Ala 


Arg 


Glu 




66 


130 






135 










140 










68 


age ccc 


aag gag 


aag 


gag ctt 


ttc 


gec 


etc 


gec 


cag 


ggc 


etc 


etc 


ccc 


480 


69 


Ser Pro 


Lys Glu 


Lys 


Glu Leu 


Phe 


Ala 


Leu 


Ala 


Gin 


Gly Leu 


Leu 


Pro 




70 


145 






150 








155 










160 




72 


gag ggc 


gtg gac 


ccg 


ggg gtg 


tgg 


aac 


cag 


gee 


etc 


atg 


gag 


etc 


ggg 


528 


73 


Glu Gly Val Asp 


Pro 


Gly Val 


Trp 


Asn 


Gin 


Ala 


Leu 


Met 


Glu 


Leu 


Gly 




74 






165 








170 










175 




76 


gec acg 


gtc tgc 


ctg 


ccg aaa 


egg 


ccc 


cgt 


tgc 


ggg 


gec 


tgc 


ccc 


eta 


576 


77 


Ala Thr 


Val Cys 


Leu 


Pro Lys 


Arg 


Pro 


Arg 


Cys 


Gly 


Ala 


Cys 


Pro 


Leu 




78 




180 








185 










190 








80 


ggg gec 


ttc tgc 


egg 


ggg aag 


gag 


gee 


ccc 


ggg 


cgc 


tac 


ccc 


gcg 


ccc 


624 


81 


Gly Ala 


Phe Cys 


Arg Gly Lys 


Glu 


Ala 


Pro 


Gly Arg 


Tyr 


Pro 


Ala 


Pro 




82 




195 






200 










205 










84 


agg aag 


cgc egg 


gcg 


aag gag 


gag 


cgc 


etc 


gtc 


gee 


etc 


gtc 


etc 


etc 


672 


85 


Arg Lys 


Arg Arg 


Ala 


Lys Glu 


Glu 


Arg 


Leu 


Val 


Ala 


Leu 


Val 


Leu 


Leu 




86 


210 






215 










220 












88 


ggg egg 


aag ggg 


gtg 


cac ctg 


gaa 


agg 


ctt 


gag 


ggg 


cgc 


ttc 


cag 


ggc 


720 


89 


Gly Arg 


Lys Gly 


Val 


His Leu 


Glu 


Arg 


Leu 


Glu 


Gly 


Arg 


Phe 


Gin 


Gly 




90 


225 






230 








235 










240 . 




92 


etc tac 


ggc gtc 


ccc 


etc ttt 


ccc 


cct 


gag 


gag 


ctt 


ccc 


ggg 


egg 


gag 


768 


93 


Leu Tyr Gly Val 


Pro 


Leu Phe 


Pro 


Pro 


Glu 


Glu 


Leu 


Pro 


Gly Arg Glu 




94 






245 








250 










255 






96 


gcg gec 


ttc ggg 


gtg 


agg tct 


agg 


ccc 


eta 


ggc 


gag 


gtg 


cgc 


cac 


gee 


816 


97 


Ala Ala 


Phe Gly 


Val 


Arg Ser 


Arg 


Pro 


Leu 


Gly 


Glu 


Val 


Arg 


His 


Ala 




98 




260 








265 










270 








100 


etc acc 


cac egg 


agg 


ctt cgc 


gtg 


gag 


gtg 


egg 


ggg 


gec 


Ctt 


tgg 


gaa 


864 


101 


Leu Thr 


His Arg Arg 


Leu Arg 


Val 


Glu 


Val 


Arg 


Gly 


Ala 


Leu 


Trp 


Glu 




102 




275 






280 










285 








104 


ggg gag 


ggg gag 


gac 


ccc tgg 


aag 


agg 


ccc 


eta 


ccc 


aag 


etc 


atg 


gag 


912 


105 


Gly Glu 


Gly Glu Asp 


Pro Trp 


Lys 


Arg 


Pro 


Leu 


Pro 


Lys 


Leu 


Met 


Glu 




106 


290 






295 










300 












108 


aag gtg 


etc cgc 


aag 


gcg ctt 


ccc 


etc 


etc 


get 


cat 


gcg 


ggc 


gta 


gtc 


960 


109 


Lys Val 


Leu Arg 


Lys 


Ala Leu 


Pro 


Leu 


Leu 


Ala 


His 


Ala Gly 


Val 


Val 




110 


305 






310 








315 










320 




i i -) 
±±z 


ccc etc 


ccg gac 


gca 






















975 


113 


Pro Leu 


Pro Asp 


Ala 
























114 






325 
























117 


<210> SEQ ID NO 


: 2 
























118 


<211> LENGTH: 325 
























119 


<212> TYPE: PRT 


























120 


<213> ORGANISM: 


Thermus thermophilus 
















122 


<4 00> SEQUENCE : 


2 
























124 


Met Glu 


Ala Trp 


Arg 


Lys Ala 


Leu 


Leu 


Ala 


Trp 


Tyr 


Arg 


Glu 


Asn 


Ala 




125 


1 




5 








10 








15 






128 


Arg Pro 


Leu Pro 


Trp 


Arg Gly Glu 


Lys 


Asp 


Pro 


Tyr 


Arg 


Val 


Leu 


Val 




129 




20 








25 










30 








132 


Ser Glu 


Val Leu 


Leu 


Gin Gin 


Thr 


Arg 


Val 


Glu 


Gin 


Ala 


Leu 


Pro 


Tyr 
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133 



35 

Arg Phe Leu Glu Arg 
55 

Leu Glu Glu Val Leu 
70 

Ala Glu His Leu His 
85 

Ser Phe Ala Glu Leu 
100 



40 45 



13 6 Tyr Arg 
137 50 

140 Ala Ser 

141 65 



Arg Val Trp Gin Gly Ala Gly Tyr Tyr 



Phe Pro Thr Leu Lys Ala Leu Ala Ala 



75 80 



60 



144 Arg Arg 
145 



Arg Leu Ala Arg Ser Val Glu Glu Leu 



90 95 



14 8 Pro Pro 
149 



Arg Gly Leu Pro Gly Leu Gly Pro Tyr 




152 


Thr Ala 


Ala Ala 


Val 


Ala 


Ser 


He 


Ala 


Phe 


Gly Glu Arg 


Val 


Ala 


Ala 


153 




115 








12 0 






125 








156 


Val Asp 


Gly Asn 


Val 


Arg Arg 


Val 


Leu 


Ser 


Arg Leu Phe 


Ala 


Arg 


Glu 


157 


130 








135 








140 






160 


Ser Pro 


Lys Glu 


Lys 


Glu 


Leu 


Phe 


Ala 


Leu 


Ala Gin Gly 


Leu 


Leu 


Pro 


161 


145 






150 










155 






160 


164 


Glu Gly 


Val Asp 


Pro 


Gly Val 


Trp 


Asn 


Gin 


Ala Leu Met 


Glu 


Leu 


Gly 


165 






165 










170 






175 


168 


Ala Thr 


Val Cys 


Leu 


Pro 


Lys 


Arg 


Pro 


Arg 


Cys Gly Ala 


Cys 


Pro 


Leu 


169 




180 










185 






190 






172 


Gly Ala 


Phe Cys 


Arg 


Gly Lys 


Glu 


Ala 


Pro 


Gly Arg Tyr 


Pro 


Ala 


Pro 


173 




195 








200 






205 








176 


Arg Lys 


Arg Arg 


Ala 


Lys 


Glu 


Glu 


Arg 


Leu 


Val Ala Leu 


Val 


Leu 


Leu 


177 


210 








215 








220 








180 


Gly Arg 


Lys Gly Val 


His 


Leu 


Glu 


Arg 


Leu 


Glu Gly Arg 


Phe 


Gin 


Gly 


181 


225 






230 










235 






240 


184 


Leu Tyr 


Gly Val 


Pro 


Leu 


Phe 


Pro 


Pro 


Glu 


Glu Leu Pro 


Gly Arg Glu 


185 






245 










250 






255 




188 


Ala Ala 


Phe Gly Val 


Arg 


Ser 


Arg 


Pro 


Leu 


Gly Glu Val 


Arg 


His 


Ala 


189 




260 










265 






270 






192 


Leu Thr 


His Arg Arg 


Leu 


Arg 


Val 


Glu 


Val 


Arg Gly Ala 


Leu 


Trp 


Glu 


193 




275 








280 






285 






196 


Gly Glu 


Gly Glu Asp 


Pro 


Trp 


Lys 


Arg 


Pro 


Leu Pro Lys 


Leu 


Met 


Glu 


197 


290 








295 








300 








200 


Lys Val 


Leu Arg 


Lys 


Ala 


Leu 


Pro 


Leu 


Leu 


Ala His Ala 


Gly Val 


Val 


201 


305 






310 










315 






320 


204 


Pro Leu 


Pro Asp 


Ala 




















205 






325 





















208 <210> SEQ ID NO: 3 

209 <211> LENGTH: 1998 

210 <212> TYPE: DNA 

211 <213> ORGANISM: Thermus thermophilus 

214 <220> FEATURE: 

215 <221> NAME/KEY: CDS 

216 <222> LOCATION: (1)..(1998) 

218 <400> SEQUENCE: 3 

219 atg agg gac egg gtc cgc tgg egg gtg ctt tec etc cct ccc etc gec 48 
22 0 Met Arg Asp Arg Val Arg Trp Arg Val Leu Ser Leu Pro Pro Leu Ala 

221 1 5 ■ 10 15 

22 3 cag tgg egg gag gtg atg gcg gec ttg gag gtg ggg ccg gag gec gec 96 
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224 Gin Trp Arg Glu Val Met Ala Ala Leu Glu Val Gly Pro Glu Ala Ala 

225 20 25 30 

227 ctg gcc tac tgg cac egg ggc ttt agg cgc aag gag gac ctg gac ccc 144 

228 Leu Ala Tyr Trp His Arg Gly Phe Arg Arg Lys Glu Asp Leu Asp Pro 

229 35 40 45 

231 ccc etc gcc etc ctt ccc etc aag ggc ctg agg gag gcg gcg gcc etc 192 
2 32 Pro Leu Ala Leu Leu Pro Leu Lys Gly Leu Arg Glu Ala Ala Ala Leu 
233 50 55 60 

235 ctg gag gag gcg etc cgc cag ggg aag egg ate cgc gtc cac ggg gac 240 
23 6 Leu Glu Glu Ala Leu Arg Gin Gly Lys Arg He Arg Val His Gly Asp 
237 65 70 75 80 

239 tac gac gcc gac ggg etc acg ggc acg gcc ate ctg gtt egg ggc etc 288 

240 Tyr Asp Ala Asp Gly Leu Thr Gly Thr Ala lie Leu Val Arg Gly Leu 

241 85 90 95 

243 gcc gcc ttg ggc gcc gac gtc cac ccc ttc ate ccc cac egg ctg gag 336 

244 Ala Ala Leu Gly Ala Asp Val His Pro Phe He Pro His Arg Leu Glu 

245 100 105 110 

247 gaa ggg tac ggg gtg ctg atg gag egg gtt ccc gag cac etc gag gcc 384 

248 Glu Gly Tyr Gly Val Leu Met Glu Arg Val Pro Glu His Leu Glu Ala 

249 115 120 125 

251 teg gac etc ttc etc ace gtg gac tgc ggg ate acg aac cac gcc gag 432 

252 Ser Asp Leu Phe Leu Thr Val Asp Cys Gly He Thr Asn His Ala Glu 

253 130 135 140 

255 etc agg gag ctt ttg gaa aac ggg gtg gag gtg ate gtc acc gac cac 480 

256 Leu Arg Glu Leu Leu Glu Asn Gly Val Glu Val He Val Thr Asp His 

257 145 150 155 160 

25 9 cac acc ccc ggc aag acc cct tec ccc ggc etc gtg gtc cac ccc gcc 52 8 

26 0 His Thr Pro Gly Lys Thr Pro Ser Pro Gly Leu Val Val His Pro Ala 
261 165 170 175 

263 etc acc ccg gac ctt aag gag aag ccc acg ggg gcg ggg gtg gtc ttc 576 

264 Leu Thr Pro Asp Leu Lys Glu Lys Pro Thr Gly Ala Gly Val Val Phe 

265 180 185 190 

267 etc etc etc tgg gcc etc cac gag cgc ctg ggc ctt ccc cca ccc ctg 624 
26 8 Leu Leu Leu Trp Ala Leu His Glu Arg Leu Gly Leu Pro Pro Pro Leu 
269 195 200 205 

271 gag tac gcc gac etc gcc gcg gtg ggc acc ate gcc gac gtg gcc ccc 672 

272 Glu Tyr Ala Asp Leu Ala Ala Val Gly Thr He Ala Asp Val Ala Pro 

273 210 215 220 

2 75 ctt tgg ggc tgg aac egg gcc ttg gtg aag gag ggc ctg gcc cgc ate 72 0 

2 76 Leu Trp Gly Trp Asn Arg Ala Leu Val Lys Glu Gly Leu Ala Arg He 
277 225 230 235 240 

279 ccc gcc tec tec tgg gtt ggg etc agg ctt ctg gcc gag gcg gtg ggg 768 

280 Pro Ala Ser Ser Trp Val Gly Leu Arg Leu Leu Ala Glu Ala Val Gly 

281 245 250 255 

2 83 tac acg ggg aag gcg gtg gag gtg gcc ttc cgc ate gcc ccc egg ate 816 

284 Tyr Thr Gly Lys Ala Val Glu Val Ala Phe Arg He Ala Pro Arg lie 

285 260 265 270 

2 87 aac gcg gca age cgc etc ggg gag get gag aag gcc eta agg etc etc 864 
2 88 Asn Ala Ala Ser Arg Leu Gly Glu Ala Glu Lys Ala Leu Arg Leu Leu 



file://C:\CRF4\Outhold\VsrI938901A.htm 



9/28/04 



Page 5 of 8 



RAW SEQUENCE LISTING DATE : 09/28/2004 

PATENT APPLICATION: US/0 9/93 8 , 901A TIME: 10:38:43 

Input Set : A: \11283 -013001. ST25 . txt 
Output Set: N:\CRF4\09282004\l938901A.raw 

289 275 280 285 

291 etc acc gac gac gcg gec gag gec cag gec etc gtg ggg gaa etc cac 912 
2 92 Leu Thr Asp Asp Ala Ala Glu Ala Gin Ala Leu Val Gly Glu Leu His 
293 290 295 300 

295 egg ctg aac gec cgc cgc cag acc ctg gag gag gee atg etc agg aag 960 

296 Arg Leu Asn Ala Arg Arg Gin Thr Leu Glu Glu Ala Met Leu Arg Lys 

297 305 310 315 320 

299 etc etc ccc cag gcg gac ccc gag gec aag gec ate gtc etc ctg gac 1008 

300 Leu Leu Pro Gin Ala Asp Pro Glu Ala Lys Ala He Val Leu Leu Asp 

301 325 330 335 

303 ccc gag ggg cac ccg ggg gtg atg ggc ate gtg gcg age cgc ate ctg 1056 

304 Pro Glu Gly His Pro Gly Val Met Gly He Val Ala Ser Arg He Leu 

305 340 345 350 

307 gag gec acc etc egg ccc gtc ttc ctg gtg gec cag ggc aag ggg acg 1104 

308 Glu Ala Thr Leu Arg Pro Val Phe Leu Val Ala Gin Gly Lys Gly Thr 

309 355 360 365 

311 gtg egg age etc gec ccc ate age gec gtg gag gec eta agg age gee 1152 

312 Val Arg Ser Leu Ala Pro He. Ser Ala Val Glu Ala Leu Arg Ser Ala 

313 370 J 375 380 

315 gag gac ctt ttg ttg cgc tac ggg ggg cac aag gag gcg gcg ggc ttc 1200 

316 Glu Asp Leu Leu Leu Arg Tyr Gly Gly His Lys Glu Ala Ala Gly Phe 

317 385 390 395 40Q 

319 gec atg gac gag gec etc ttc ccc gec ttc aag gec egg gtg gag gee 1248 

320 Ala Met Asp Glu Ala Leu Phe Pro Ala Phe Lys Ala Arg Val Glu Ala 

321 405 410 415 

323 tac gec gec cgc ttc ccc gac ccc gtg cgc gag gtg gec ctt ttg gac 1296 

324 Tyr Ala Ala Arg Phe Pro Asp Pro Val Arg Glu Val Ala Leu Leu Asp 

325 420 425 430 

327 ctg ctt ccg gag ccc ggc etc etc ccc cag gtc ttc egg gag etc gec 1344 
32 8 Leu Leu Pro Glu Pro Gly Leu Leu Pro Gin Val Phe Arg Glu Leu Ala 
329 435 440 445 

331 ctt ttg gag ccc tac ggc gag gga aac ccc gag ccc etc ttc etc etc 1392 

332 Leu Leu Glu Pro Tyr Gly Glu Gly Asn Pro Glu Pro Leu Phe Leu Leu 

333 450 455 460 

335 ttc ggc gec ccg gag gag gec egg cgc etc ggg gag ggc cgc cac etc 1440 

336 Phe Gly Ala Pro Glu Glu Ala Arg Arg Leu Gly Glu Gly Arg His Leu 

337 465 470 475 480 

339 gec ttc cgc ctg aag ggg gtg egg gtc ctg gec tgg aaa cag ggg gac 1488 

340 Ala Phe Arg Leu Lys Gly Val Arg Val Leu Ala Trp Lys Gin Gly Asp 

341 485 490 495 

343 etc gec ctg ccc ccg gag gtg gag gtg gcg ggc etc etc age gaa aac 1536 

344 Leu Ala Leu Pro Pro Glu Val Glu Val Ala Gly Leu Leu Ser Glu Asn 

345 500 505 510 

347 gec tgg aac ggc cac etc gec tac gag gtc cag gcg gtg gac ctg cga 1584 

348 Ala Trp Asn Gly His Leu Ala Tyr Glu Val Gin Ala Val Asp Leu Arg 

349 515 520 525 

351 aag cca gag gcg ctg gag ggc ggg ate gcg ccc ttc gec tac ccc ctg 1632 

352 Lys Pro Glu Ala Leu Glu Gly Gly He Ala Pro Phe Ala Tyr Pro Leu 

353 530 535 540 
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Please Note: 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> 
to <223> fields of each sequence which presents at least one n or Xaa. 

Seq#:19; N Pos. 26 
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L:1408 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:19 after pos . : 0 
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